On the Probability Distribution of Typological Frequencies
نویسنده
چکیده
Some language types are more frequent among the world’s languages than others, and the field of linguistic typology attempts to elucidate the reasons for such differences in type frequency. However, there is no consensus in that field about the stochastic processes that shape these frequencies, and there is thus likewise no agreement about the expected probability distribution of typological frequencies. This paper explains the problem and presents a first attempt to build a theory of typological probability purely based on processes of language change. 1 Probability distributions in typological research A central objective of the typological study of linguistic diversity is to explain why certain kinds of linguistic structures are much more frequently attested among the world’s languages than others. Unfortunately, such interpretations of empirically attested frequencies often rely on purely non-mathematic intuitions to judge whether observed frequencies are in any sense noteworthy or not. In the typological literature, this frequently leads to a tacit assumption that typological frequencies are evenly distributed, i.e. that a priori all language-types should be equally frequent, and any observed skewing of frequencies is thus in need of an explanation. Such an argumentation can be found, for example, in the widely read typological textbook by Comrie [2]: “In a representative sample of languages, if no universal were involved, i.e. if the distribution of types along some parameter were purely random, then we would expect each type to have roughly an equal number of representatives. To the extent that the actual distribution departs from this random distribution, the linguist is obliged to state and, if possible, account for this discrepancy” (p. 20) Various more sophisticated approaches to the interpretation of empirical frequencies often assume an underlyingly normal (or, more precisely, multinomial) distribution, as for example indicated by the regular use of χ statistics or Fisher’s exact test (e.g. in Cysouw [3]). Janssen et al. [6] and Maslova [12] explicitly discuss the problem of tacit assumptions of the underlying probability distributions as made in linguistic typology. As a practical solution to circumvent this problem for the assessment of statistical significance, Janssen et al. [6]
منابع مشابه
Probability Model of Allele Frequency of Alzheimer’s Disease Genetic Risk Factor
Background and Purpose: The identification of genetics risk factors of human diseases is very important. This study is conducted to model the allele frequencies (AFs) of Alzheimer’s disease. Materials and Methods: In this study, several candidate probability distributions are fitted on a data set of Alzheimer’s disease genetic risk factor. Unknown parameters of the considered dis...
متن کاملAssessment of Goodness of Fit Methods in Determining the Best Regional Probability Distribution of Rainfall Data
One of the most important problems in time series analysis of stream flow and rainfall data in an area is selecting the best probability distribution. Since the rainfall stations are associated and correlated with each other, so statistical analysis of the station data seamlessly are very important. Therefore, the first step in data analysis, is selecting the prevailing probability distribution...
متن کاملVelocity Distribution in the 90-degree Bend based on the Probability and Entropy Concept
Practical concept of velocity distribution of pressure flow in the bends is interesting and hence, the professional engineering design has been investigated in the current study. This paper shows that velocity distribution in the bends can be analyzed in terms of the probability distributions. The concept of entropy based on the probability is an applied and new approach to achieve velocity pro...
متن کاملMirror Neurons and (Inter)subjectivity: Typological Evidence from East Asian Languages
Language is primarily constituted by action and interaction based on sensorimotor information. This paper demonstrates the nature of subjectivity and intersubjectivity through the neural mechanism and typological evidence of sentence-final particles from East Asian languages and extends to the discussion of the relationship between them. I propose that intersubjecivity is a kind of embedded or ...
متن کاملUsing Weibull probability distribution to calibrate prevailing wind applying in oil spill simulation
In the Persian Gulf, the major source of oil pollution is related to the transportation of tankers, offshore production and discharges by coastal refineries. The water dynamical field has been obtained using a new hydrodynamic model. Local wind is recognized as the principal driving force combining to the water dynamic field to determine oil drift on the sea surface. The Weibull probability dis...
متن کامل